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@ Method and apparatus for combining encoding and modulation. 

(g) A method and apparatus for combining encoding and 
modulation creates signal sets from available amplitude and 
phase modulations by indexing ordered subspaces. The 
subspaces need not be limited to the class of subspaces 
known as binary subspaces. The resultant signal sets, for a 
preselected power and bandwidth, are wrdely separated and 
unlikely to be confused by the effects of channel noise. Such 
signals can be in either finite block or convolutional form, 
depending on the natural format of the desired transmission. 
Further according to the invention are basic apparatus for 
encoding and modulating as well as demodulating and 
decoding a signal in accordance with the invention. Specifically, 
a method is provided for decoding that incorporates a specfflc 
type of decoding/demodulation techniques which develops 
accurate estimates of the information from the received signal 
in a computationally efficient manner and which permits high 
speed operation using soft-decision decoders. 
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Description 

METHOD AND APPARATUS FOR COMBINING ENCODING AND MODULATION 



5 BACKGROUND OF THE INVENTION 

This invention relates to digital communications and more specifically to techniques for constructing 
bandwidth efficient signal sets by combining error correcting encoding with modulation of digital data. More 
specifically, the invention relates to generalized methods and apparatus for encoding and modulating digital 
information signals and methods and apparatus for demodulating and decoding signals from an information 

10 channel containing information. The invention finds particular application to time-division multiple-access 
(TDMA) operation in a frequency-division multiple-access environment, such as a satellite transponder 
channel. 

In order to aid in identifying the relevance of the references cited herein, the references cited herein are 
referred to frequently by the abbreviations following the citations listed hereinbelow. 

15 In electronic data communication systems, random noise or interference can cause the transmitted signal to 
be contaminated and lead to errors in the received message. In systems where the reliability of received data is 
very important, error-correcting codes have been used to protect the transmitted message and enable system 
designers to reduce the effects of noise. Two major schools of thought and associated bodies of theory have 
emerged for performing this task: algebraic block coding, which relies heavily on the use of modern algebra 

20 and typically constructs codes as linear subspaces of a fixed size vector space over a finite field; and 
convolutional coding, in which the transmission is viewed as being continuous and the design typically relies 
more on computer search techniques and close analysis of the state diagram of the possible convolutional 
encoder circuits. 

For many years, the coding process was effectively separated from the problem of modulation in 
25 conventional systems. Modulation is the creation of, for example, electromagnetic signals in which changes in 
phase, frequency, or amplitude are used to distinguish different messages. 

Referring to Figure 1 representing a prior art system 10, in conventional systems 10 a block or stream of 
information digits 12 is fed into a digital encoder 14 designed for a specific error-correcting code where 
redundant check bits are added. The resultant encoded digits 16 are then fed into a modulator 18 where each 
30 digit or set of digits is typically mapped to a modulated symbol to be transmitted as information in for example 
a radio frequency signal 20. The radio frequency signal 20 is applied to a channel 22 wherein noise and 
interference 24 are added and then received as a signal with errors 26 at a demodulator 28. The demodulator 
28 attempts to extract from the signal with errors 26 redundant digits with errors 30 which are fed to an error 
correcting decoder 32 designed to accommodate the error correcting code. The decoder 28 then uses known 
35 redundancy structure of in the encoded digits 16 to eliminate as many errors as possible producing as its 
output estimated received digits 34. In some systems, the demodulator 28 also provides "soft decision 
information" or "reliability" information along with the estimate of the received digits which can be used 
effectively in a variety of error-correcting decoders to improve performance, particularly Viterbi decoders for 
convolutional codes. 

40 To maintain the separations between different messages guaranteed by the minimum Hamming distance of 
the error-correcting code, the mapping performed by the demodulator 28 must be chosen with care. (The 
Hamming distance between two words is the number of digits in which the two words differ. The minimum 
Hamming distance of a code is the minimum over all pairs of code words in the code of the Hamming distance 
between the two code words.) For example, in binary systems using phase-shift modulations, the 

45 correspondence between the redundant binary sequences and the particular phase of a transmitted signal is 
often dictated by a Gray code. 

The use of error-correcting coding in this manner frequently is an alternative to increasing the power of the 
transmitted signal to overcome the noise. Conversely, the use of coding permits the power of the transmission 
to be reduced with no degradation in the reliability of the transmission. The power savings obtained in this way 

50 are measured in terms of the allowable reduction in decibels of power-per-bit for the same bit error rate, a 
quantity referred to as "coding gain." However, since coding requires the addition of redundant digits, for a 
fixed modulation scheme the use of coding requires that symbols be sent at a faster rate, thereby increasing 
the frequency bandwidth occupied by the transmission. 
As the demand for communication links has increased, there has been growing competition for the available 

55 electromagnetic spectrum, and significant expansion of the bandwidth of the signal to reduce the power 
required has no longer been acceptable in many instances. Thus attention has turned to methods of 
combining coding and modulation into one coordinated mapping to achieve signals that are efficient in both 
power and bandwidth utilization. In the past, efforts have followed the two pathways set by error-correcting 
coding theory, with some building on the concepts of convolutional codes whereas others start from the block 

60 code ideas. 

In the convolutional school, a major step forward was made by Ungerboeck as described in his paper 
"Channel Coding with Multilevel/Phase Signals" [ung], in which he pointed out that the Euclidean distance 
properties of the electromagnetic signal space could be incorporated into the design of a convolutional code 
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encoder. Figure 2 illustrates the basic structure for comparison with Figure 1 , Using the trellis characterization 
of the encoder, i.e., a trellis encoder 44, information digits 12 are mapped directly to modulated signals 20, so 
as to add redundancy only when the electromagnetic symbols are likely to be confused. The error-correcting 
encoder and modulator are combined into a single coder/modulator herein called the treflis encoder 44. The 
standard Viterbi algorithm for decoding convolutional codes can be readily adapted to a so-called Viterbi trellis 5 
decoder 48 to decode the received symbols (signal with errors 26) directly to estimated information digits 34. 
In adapting the convolutional coding methodology, Ungerboeck chose not to "pursue the block coding aspect 
because the richer structure and omission of block boundaries together with the availability of Viterbi 
ML-decoding algorithm [sic ] make trellis codes appear to us more attractive for present coding problem 
[ung.p.58]." 10 

Others have followed Ungerboeck. For example, recently S. Q. Wilson has shown a construction for rate 5/6 
trellis codes for an 8-state phase-shift keying (8-PSK) system and has found tfiat it achieves an asymptotic 
gain of 6.2 dB over an uncoded 8-PSK system [wlsn]. 

Other researchers have pursued the construction of efficient coding/modulation systems from the algebraic 
block code point of view. Imai and Hirakawa showed how error-correcting codes of Increasing strengths can 15 
be coupled to increasingly error-sensitive parameters of the signal modulation in both multilevel and 
mutliphase modulations to give improved performance. Furthermore they explained a staged decoding 
method in which the most sensitive parameters are estimated first, using the^ajjqsteitiori prpbabJIIties based 
on the channel statistics and the code structure wherein those estimates are used in later probability 
calculations to determine estimates for the successively less sensitive parameters [i&h]. 20 

Similarly, V. V. Ginzburg has used algebraic techniques to design multilevel multiphase signals for a 
continuous channel. His methods address the case where the measure of distance in the continuous channel 
is monotonically related to an additive function of the distances between individual signal components. (Such 
additivity is commonly assumed in satellite channel models, for example.) He generalized the ideas of imai and 
Hirakawa by partitioning the set of elementary modulation signals into carefully chosen subsets that permit the 25 
actual channel distance between signals to be associated with the particular subsets in which the signals are 
found. He then combined a hierarchy of subsets with a matching hierarchy of codes of Increasing strength to 
design signal sets that are guaranteed to have large separations in the signal space. The algorithms he 
suggested for demodulating and decoding the signals has been given in only abstract mathematical terms: "A 
rigorous maximum-likelihood demodulation procedure of acceptable complexity may be built only in 30 
exceptional cases. A most simple approximate procedure implementing an energy distance D (i.e., one that 
leads to a correct decision for a noise energy < D/4) may be built as a sequence of integral reception 
procedures (to be carried out in the order of decreasing levels) for the individual codes that define the 
signal-system construction, if each of them implements D [gnz]. 

Most recently, Sayegh [syh] has developed further the methods of imai and Hirakawa by explicitly defining 35 
particular block codes that can be attached to the various levels of a hierarchy which admits to soft- decision 
decoding procedures, and he has demonstrated some achievable gains using his methods through simulation 
studies for very short codes. Sayegh's work is notable as well in that he has shown how Imai and Hirakawa's 
method can be combined with the signal set partitions of Ungerboeck to create combined coding and 
modulation systems based on several other signal constellations. Sayegh's work represents what Is believed 40 
to be the most relevant development to the present invention. However, Sayegh does not represent a prior art 
publication, since publication was less than one year prior to the filing date of the present application. 

Other authors [ck&sl] [frny86] have approached the problems of constructing bandwidth efficient signal 
sets using mathematically defined lattices. Thus, their work is distinguishable as essentially unrelated to the 
present scheme. 45 

Deficiencies in the Prior Art 

To understand the key advances of the present invention, it is important to understand the deficiency of the 
conventional practice of decomposing the process of creating the radio wave signal into separate processes 
of error-correcting encoding followed by modulation with, for example in the case of PSK, a Gray code; how 50 
the more recent methods ([i&h], [ung] and [gnz]) create a more effective linkage between the code structure 
and the demodulator and thus improve the quality of the constructed signal set; and how the new methods of 
the present invention present significant advantages in the effectiveness of the linkage both for creating high 
quality signal sets and facilitating computation of the estimation of the transmitted information by the receiver. 

The preponderance of the literature and theory on the construction of error-correcting codes addresses the 55 
problem of creating linear codes (e.g., [v&o, pp. 82-101]), which are well suited for protecting messages to be 
sent across a symmetric channel. For the symmetric channel, a central index of the quality of a code is its 
minimum Hamming distance, the minimum number of digits in which any two distinct code words must differ. 
The mathematical theory of codes such as BCH codes, which are output from an encoder to a modulator, 
attempts to guarantee that the minimum Hamming distance of the code In the (digital) vector space of the 60 
communication channel is as large as possible for a given code rate and block length, since the larger the 
minimum distance, the more errors that are certain to be correctable by a maximum likelihood decoding 
algorithm. 

However, on the continuous white Gaussian channel, most modulation schemes do not induce a symmetric 
channel from the perspective of the digital error-correcting coding system. Certain pairs of elementary 65 
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modulations are closer in the Euclidean space of the channel than are others, and thus they are more likely to 
be confused than others. At very high signal-to-noise ratios, the probability of error for a maximum likelihood 
decoder is directly related to the minimum Euclidean distance separating any pair of distinct transmitted 
signals. Thus, for an entire coding/modulation system, an important measure is the minimum Euclidean 
5 distance. 

To illustrate by reference to Figure 3, in the case of phase-shift modulations such as 8-PSK in conventional 
systems, the use of a Gray code mapping of binary triples to phase states serves the purpose of assuring that 
the Hamming distance of the error-correcting code is reflected in a reasonable lower bound on the Euclidean 
distance. In Figure 3 for a specific example, since by definition of a Gray code, the Gray code binary sequences 
10 associated with any pair of adjacent modulations differ in exactly one bit, the minimum squared Euclidean 
distance of a pair of signals in a two-dimensional signal subspace formed by a binary error-correcting code of 
minimum Hamming distance D and Gray coding must be at least D 5 2 , where 5 2 is the minimum squared 
Euclidean distance between modulation phases. 
This scheme is deficient because it creates a potential ambiguity due to the simultaneous occurrence of 
15 minimums. Specifically, the minimum Hamming distance separation in binary sequences can be achieved 
simultaneously with the minimum squared Euclidean distance of the modulation, thus creating a signal set for 
which a pair of signals is separated by a squared Euclidean distance that is the product of minimums, D 5 2 , 
which is the undesirable ambiguity. 

Both Ungerboeck and Ginzburg avoid the possibility of the simultaneous occurrence of minimums by 
20 coupling the coding to the modulation via a careful nested partitioning of the available modulations into 
subsets. In Ungerboeck's language, the mapping of binary sequences to modulations "follows from 
successive partitioning of a channel-signal set into subsets with increasing minimum distances: 
Ao < Ai < A2 ... 

between the signals of these subsets." Ungerboeck then attaches modulations of particular subsets directly 
25 to edges in a trellis encoder for a convolutional code in such away that the constraints of the trellis prevent the 
smallest Euclidean distances from occurring on all of the edges of the short paths originating and terminating 
at the zero state of the trellis. Effectively, this prevents a simultaneous occurrence of minimums in both the 
redundancy imposed by the trellis and the Euclidean separation of the modulation. 
Similarly, Ginzburg, in his hierarchical construction, defines partitions with L levels and associated minimum 
30 squared Euclidean distances satisfying: 
5i_ 2 < ... < A 2 2 < 6i 2 

(adapting Ginzburg's language to the present notation). He then associates a different error-correcting code 
with each level, the code for the 1th level having a minimum distance Di. (Note the difference between the 
variable 1 and the numeral 1 herein.) The squared Euclidean distance for the signals thus created must be at 
35 least: 

D * rain (6^ D 1 ) . 
1<1£L 

40 

Since the Di values are chosen to be larger for the 61 that are smaller, this minimum is much greater than the 
product of the minima. Likewise, the earlier technique of imai and Hirakawa may be viewed as a special 
instance of Ginzburg's signal construction method. 

45 In Ginzburg's method, the use of freely created nested partitions creates some limitations. At the 1*n level, 
there are actually a very large number of partitions, a different partition for each of the subsets created in the 
partition at the (1-1 )st partition. Due to the process of successive subdivision, the number of subsets of 
modulations that must be considered grows exponentially with the level, and the 81 associated with the 1th 
level is actually a minimum over all of the subpartitions at the 1th level. 

50 With regard to prior art decoding circuitry and methods as hereinafter discussed, all of the workers in the 
field previously cited have recognized the difficulty of decoding these high performance coding modulation 
schemes. Typically systems in which combined coding and modulation is contemplated are those where soft 
decision information is available. The most commonly used algorithm is the Viterbi algorithm, which can 
incorporate soft decision information into the calculation of path metrics. The path metrics are used to 

55 determine the best estimate of each of the transmitted bits. 

There is nothing in principle that precludes the use of the Viterbi algorithm in cooperation with 
encoding/modulation systems with encoders according to the invention as described herein. If all such 
encoders are convolutional, the encoder can take the form of a trellis encoder and the resultant signal can be 
decoded using the techniques suggested by Ungerboeck for trellis codes. In practice, however, the powerful 

60 codes created by the encoders of the invention as described herein can create redundancies which interrelate 
a large number of transmitted symbols and, when viewed as trellis codes, have an extremely large number of 
states. The number of states will in most instances make the Viterbi algorithm impractical. 

For the encoder/modulators based on block codes, the same problem is encountered. Most techniques for 
decoding block codes using soft decision information are very complex. For example, the Chase algorithm 

65 t ch l can be employed when the expected number of errors is very small, but the computational effort required 
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grows exponentially when the number of errors to be corrected increases. Similarly, it is possible to 
contemplate the use of Forney's generalized minimum distance decoding [fmy66J, but the complexity of this 
technique in practice is usually prohibitive. 

What is needed therefore is very efficient low-complexity algorithms. The present inventor as described in 
Tanner [tan81] has described algorithms which as described hereinbelow have potential interest In 5 
decoding/demodulation methods and apparatus according to the invention. The Tanner article Is therefore 
incorporated herein by reference and made a part hereof. Tanner's algorithms lead to a wide variety of decoder 
architectures, many of which are particularly well suited to parallel implementation in large scale integrated 
circuits. While the algorithms described in the Tanner article (e.g., Algorithm B) do not perform maximum 
likelihood decoding, they can be used effectively to decode much longer codes than can other methods, either 10 
large length block codes or large constraint length convolutional codes, without incurring prohibitive circuit 
complexity costs. Moreover, the sub-optimal Algorithm B, for example, can incorporate soft decision 
information into the decoding process for either block or convolutional codes with very little increase in the 
complexity of the decoding algorithm. In practice, decoding systems based on Tanner algorithms can 
outperform many Viterbi algorithm-based systems because the sub-optimaiity of the decoder performance is 15 
minor compared to the advantage of using a more powerful coding/modulation design. 

In most advanced modulation systems, the demodulator receives a single symbol that can be viewed as a 
vector in some symbol vector space. It then computes a distance measure for the, separation between the 
received symbol and ail of the possible elementary modulations. In so-called "hard decision" demodulators, 
the distance measure is heavily quantized to two implicit values. The closest elementary modulation is emitted 20 
as the best estimate (implicitly at distance 0) and the others are viewed as being at distance 1. In "soft 
decision" demodulators, the demodulator can put out numbers, typically viewed as quantized representations 
of real numbers, that indicate the likelihood that the received vector came from each of the possible 
elementary modulations. 

Making optimal use of this "soft decision" information as well as the constraints Imposed by the digital code 25 
structure is in general very difficult. While it is possible to contemplate the design of a decoder based on 
Tanner's algorithms that would use ail of the probability information provided by the channel for a 
sophisticated modulation scheme, heretofore nothing has been taught or suggested which could optimize the 
process in a simple fashion. 

Imai and Hirakawa proposed a technique for pure 2M-ary PSK and pure multilevel signaling wherein the most 30 
sensitive or least significant bit of the modulation are decoded using calculated probabilities [i&h, p. 373]. The 
estimates for these least significant bits are then fed into a decoder for the L-th error-correcting code, 
generally the most powerful code. The final decision bits from this L* decoder are then used in combination 
with the channel soft-decision information in providing the probability estimates needed for the (L-1)«t 
decoder. The final decision bits from the (L-1)st stage along with the final decisions from the stage are in 35 
turn used in providing the probability estimates needed for the (L-2H decoder, and so forth, until the most 
significant bits are decoded. 

While the description is very incomplete, Sayegh's decoding procedures [syh, p. 1044] appear to be those of 
Imai and Hirakawa adapted to the additional modulations he treats, requiring an optimal decision at each stage 
of a sub-optimal procedure. This decomposition adds a further source of potential loss. If the decoder at some 40 
stage does not provide the correct final decision bits, the estimates used in calculating probabilities for ail 
successive stages will reflect the errors and increase the possibility of decoding error in those later stages. 
However, for high-speed systems the decomposition has the advantage of creating parallel data paths for the 
decoding of the bits of different significance. In the representative circuit shown in Figure 2 of Imai and 
Hirakawa, there are four different decoders for different error-correcting codes ail working simultaneously in 45 
pipelined fashion. As a result, each of the decoders must handle data at only a fraction of the raw input rate. In 
high performance systems, very efficient error-correcting codes must create dependencies that interrelate 
large numbers of digits, requiring either a long block length in the case of algebraic block codes or a long 
constraint length in the case of convolutional codes, As is well known to those acquainted with the coding arts, 
decoding such codes using soft decision information is very complex. Typical implementations for such 50 
decoders either cannot handle high data rates or require large and complex circuits that consume significant 
power. What is needed is a more efficient decoding scheme. 

SUMMARY OF THE INVENTION 

According to the invention, method and apparatus provide combined encoding and modulation which 55 
creates signal sets from available amplitude and phase modulations by indexing into ordered subspaces. The 
ordered subspaces need not be limited to the class of subspaces known as binary subspaces. The resultant 
signal sets, for a preselected power and bandwidth, are widely separated and unlikely to be confused by the 
effects of channel noise. Such signals can be in either finite block or convolutional form, depending on the 
natural format of the desired transmission. Further according to the invention are basic apparatus for encoding 60 
and modulating as well as apparatus for demodulating and decoding a signal in accordance with a 
demodulation/decoding method of the invention. Specifically, a demodulation/decoding method is provided 
for incorporating a known decoding technique that develops accurate estimates of the Information from the 
received signal in a computationally efficient manner and that permits high speed operation using 
soft-decision decoders. &5 
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The invention will be better understood by reference to the following detailed description in conjunction with 
the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 is a block diagram of at irst prior art error correcting code-based communication system. 

Figure 2 is a block diagram of a second prior art error correcting code-based communication system. 
Figure 3 is a phase diagram (modulation constellation diagram) illustrating a Gray code indexing of 
phase states for 8 PSK modulation. 
Figure 4 is a phase diagram illustrating indexing of phase states for a set partitioning technique. 
10 Figure 5 is a block diagram of a general encoder/modulator structure in accordance with the invention. 

Figure 6 is a constellation diagram illustrating indexing of an 8-AMPM signal in accordance with one 
embodiment of the encoder/modulation technique according to the invention. 

Figure 7 is a constellation diagram illustrating indexing of an 8-AMPM signal in accordance with a 
second embodiment of the encoder/modulation technique according to the invention. 
15 Figure 8 is a phase diagram illustrating indexing of a 6-PSK signal in accordance with a further 

embodiment of the encoder/modulator according to the invention. 

Figure 9 is a flow chart of a generalized decoding/demodulation method implemented in a preferred 
embodiment of a demodulator/decoder implemented in accordance with the invention. 

Figure 10 is a phase diagram illustrating extraction and demodulation of a signal of a prior art decoding 
20 and demodulation method. 

Figure 11 is a phase diagram illustrating extraction and demodulation of a signal according to one 
embodiment of the present invention. 

Figure 12 is a phase diagram illustrating extraction and demodulation of a signal according to a further 
embodiment of the present invention. 
25 Figure 13 is a block diagram of an encoder/modulator structure in accordance with a preferred 

embodiment of the invention. 

Figure 14 is a block diagram of a demodulator/decoder in accordance with a preferred embodiment of 
the invention. 

30 DETAILED DESCRIPTION OF THE INVENTION 

According to the present Invention, a method is provided for making the minimum Euclidean distance in the 
Euclidean space of a communication channel as large as possible, or, equivalent^ and more conveniently, to 
make the squared Euclidean distance as large as possible, rather than concentrating alone on the Hamming * 
distance of the error-correcting code in the vector space of the digital channel. More specifically, modulations 

35 are indexed to form them into ordered subspaces, which include both binary and nonbinary subspaces. The 
subspaces provide more highly disciplined indexing that could otherwise be exploited without further detailed 
knowledge of the structure of the particular modulations. 

Encoding and Modulation Methods and Circuits 

40 To create the tightest linkage between the structure of the digital error-correcting code and the modulation, 
the digital indexing of the possible modulations is crucial. Algebraic error-correcting codes are in almost all 
instances based upon finite fields and can be organized into subspaces. These subspaces are themselves 
error-correcting codes of potentially different correcting abilities. The well-known BCH codes, for example, 
can be viewed as nested, with a larger minimum distance code with more numerous roots being a subcode of 

45 a code with fewer roots and smaller minimum distance. For instance, the words of the (15,1 ,15) BCH code are 
all words in the (15,5,7) code, which are in turn all words in the (15,1 1 ,3) code, which are in turn all words in the 
trivial (15,15,1) code. It has been shown by others that such nested subcode structure could be used to 
construct recursively long and powerful error-correcting codes based on simple codes that form nested 
subspaces. 

50 According to the present invention, however, the method of constructing nested subcodes from 
error-correcting coding theory is adapted to the construction of combined coding and modulation systems. 
The available modulations are indexed by vectors over a finite field in such a way that the additive modulation 
distances are well organized by subspaces and affine (shifted) varieties of those subspaces. 
To illustrate, with reference to Figure 4, consider the indexing of the 8 PSK modulations induced by set 

55 partitioning of the type taught by Ungerboeck. in accordance with the invention, a quanternary phase shift 
keying set with minimum squared Euclidean distance of 2 is formed by fixing the set of modulations formed by 
the last of the three bits at either 0 or at 1. Consequently any pair of modulations differing only by changes 
restricted to the two-dimensional subspace formed by the topmost, the most significant bit, and the center, 
the center significant bit, have a squared Euclidean distance separation of at least 2. Similar.y, if the two 

60 rightmost indexing bits, the center and the least significant bit, are fixed at any one of the four possible values, 
the pair of modulations differing only in the most significant bit have a squared Euclidean distance separation 
of 4. (In the case of 8 PSK, this indexing of modulations is like that created by Imai and Hirakawa, using 
different notation.) * 
Figure 5 illustrates a basic encoder/modulator apparatus 100 according to the invention for exploiting the 

65 organization of the modulations described hereinabove. The encoder/modulator apparatus 100 comprises a 
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first, or Isb, encoder 102, a second, or csb, encoder 104, a third, or msb, encoder 106 and a digit-to-modulation 
mapping subsystem 108 coupled to receive, respectively, the most significant bit (msb), the center significant 
bit (csb), and the least significant bit (Isb) from inputs of information digits. The isb encoder 106 is of the form 
which generates a strong binary error-correcting code with minimum Hamming distance D3. The Isb code is 
the strongest code and therefore has the most impact on decoding. A change in one information bit going into 5 
the Isb encoder 106 causes a number of bits at least equal to the value D3 to be changed going into the 
mapping circuit (a conventional one-for-one table translation from a pure digit to a radial value signal, which in 
turn is applied to an analog signal modulator), which in turn leads to a squared Euclidean distance of at least 
83 s D3, no matter what changes occur in the information bits going into Encoders 1 and 2. If no changes occur 
in the information bits going into Isb encoder 106, then any change in the information bits going into csb 10 
encoder 104 must cause a number of bits equal to at least the value D2 to be changed, which then leads to a 
squared Euclidean distance of at least 82 2 D2, no matter what changes occur in the information bits going into 
msb encoder 102. Finally, if no changes occur in the information fed into Isb or csb encoders 106 or 104, any 
change in the information bits going into msb encoder 102 must cause at least D1 bits to be changed, which 
leads to a squared Euclidean distance of at least 81 2 D1. The minimum squared Euclidean distance separation 15 
of any pair of encoded modulations is thus at least: 

D Z min ( 6 j 2 D 1 ). 



The method practiced by apparatus according to the invention differs significantly from the methods of the 
prior art. For example, unlike both Ungerboeck and Ginzburg, the modulations are organized by subspaces, 25 
rather than by the mathematically more general but practically less useful hierarchy of subsets. The subspaces 
create more highly disciplined indexing that can be exploited without further detailed knowledge of the 
structure of the particular modulations. 

Similarly, while Imai and Hirakawa's construction lead to subspaces in the case of several particular types of 
modulations, the possibility of subspace organization was never recognized and thus no advantage has been 30 
suggested of the flexibility afforded by subspace organization. 

Still further, while Sayegh's constructions (which include 8-AMPM and 16-QASK as well as PSK 
modulations) use the modulation indexings suggested by Ungerboeck, the Ungerboeck's labellings would 
induce one of many possible sequences of nested binary subspaces for the modulations described therein, 
Sayegh failed to recognize or suggest that any equivalent subspace organization is sufficient or that the 35 
subspaces need not be binary in general. 

To understand the flexibility of the subspace organization, suppose that any arbitrary modulation can be 
indexed by subspaces of bits such that changes restricted to si bits forming a Subspace 1 induce a squared 
Euclidean distance of at least 61 2 ; changes restricted to the si bits forming a Subspace 1 or the S2 bits forming 
a Subspace 2 induce a squared Euclidean distance of at least S2 2 ; changes restricted to the si bits forming a 40 
Subspace 1 , the S2 bits forming a Subspace 2, or the S3 bits forming a Subspace 3 induce a squared Euclidean 
distance of at least 83 2 ; and so forth. We assume 8i 2 > 82 2 > 8s 2 > . . .> 8l 2 . Typically such subspace 
indexings of the modulations can be created by using the groups and subgroups of invariances of the set of 
elementary modulations. In general the subspaces need not be over extension fields of the binary field; fields 
of characteristic other than 2 may be appropriate for other signal constellations. Typically, however, extension 45 
fields of the binary field are of the greatest practical importance, and we will focus attention primarily on such 
examples. 

To perform the combined encoding and modulation, a first error-correcting encoder 102 for a code with 
minimum Hamming distance D1, producing symbols consisting of si bits, is used to produce the successive 
sets of si bits to be fed into the most significant subspace lines 103 of the digit-to-modulation mapping 50 
subsystem 108. Similarly, a second error-correcting encoder 104 for a code with minimum Hamming distance 
D2, producing symbols consisting of S2 bits, is used to produce the successive sets of S2 bits to be fed into the 
next most (center) significant subspace lines 105 of the digit-to-modulation mapping subsystem 108. This 
continues until at the bottom, an error-correcting encoder for a code with minimum Hamming distance Dl, 
producing symbols consisting of sl bits, is used to produce the successive sets of sl bits to be fed into the 55 
least significant subspace lines of the digit-to-moduiation mapping subsystem 108. In practice, the frh encoder 
may be either a convolutional encoder or a block encoder for digits in an extension field GF(2*9, or, more 
simply, si copies of a binary error-correcting encoder for a code with minimum Hamming distance D ( (although 
such copies are usually less efficient for the same level of protection). In cases where the Euclidean distance 
separations of the ith subspace mirror perfectly the Hamming distance separations of the digital code (as is the 60 
case for QPSK modulations), the si digits for the subspace may be i successive digits from an 
error-correcting code word. 

With this subspace organization, by choosing the error-correcting codes to have minimum Hamming 
distances that very roughly inversely with the squared Euclidean distance of subspaces, it is possible to create 
combined coding/modulations with very large channel distance separations at relatively high coding rate and 65 
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without bandwidth expansion. Two examples of systems for more subtle modulations will serve to illustrate the 
signal creation method and distinguish the present Invention from the more limited methods of Imai and 
Hirakawa and Sayegh. 

Figure 6 illustrates 8-AMPM modulation as two QPSK modulations 50 and 50' indexed as two subspaces 
5 which are combined with binary unit displacement 52. The resulting indexing 56 is illustrated in the adjacent 
portion of the figure. The X's correspond to the positions defined by the combined quadrature modulations. 
The corresponding index value is a three-digit binary number of an eye pattern. Figure 6 thus shows how 
8-AMPM modulation can be viewed as the combination of two "large grain" QPSK modulations each defined 
on respective two dimensional unit radius circles 50 and 50' shifted by a unit displacement 52 relative to each 
10 other. A "small grain" binary subspace determines which of the two QPSK versions circumscribed by the unit 
radius circles 50 or 50' is sent. If the QPSK signal is indexed to be broken up into two subspaces, one with 
squared Euclidean distance of 2, e.g., between 1 1 1 and 100 (54) and the other with squared Euclidean distance 
4, e.g., between 111 and 110 (56) as shown in the right side of Figure 6, these can be used as the most 
significant bits of a three bit indexing of 8-AMPM. The least significant bit is the value of the binary unit 
15 displacement 52 and is the third bit (the left-most bit of each triplet in the Figure) which determines which of 
the two displaces QPSK versions 50 or 50' is sent. The result is a labeling identical to that of Ungerboeck as 
used by Sayegh. 

Figure 7 shows another indexing scheme according to the invention. Figure 7 illustrates 8-AMPM 
modulation as two Gray code signals constellations 60 and 60' indexed as two subspaces which are combined 

20 with binary unit displacement 62. The resulting indexing 66 is illustrated in the adjacent portion of the figure. 
The X's correspond to the positions defined by the combined quadrature modulations. The corresponding 
index value is a three-digit binary number of an eye pattern. Figure 7 thus shows how 8-AMPM modulation can 
be viewed as large grain QPSK components indexed by a conventional Gray code to form two-dimensional 
subspaces in which the Hamming distance of a binary error correcting code is mirrored exactly by the squared 

25 Euclidean distance of the modulation combined by a binary unit of displacement. As before the choice of 
binary subspace (selected by binary unit displacement 62) determines which of the two versions 60 or 60' is 
transmitted. This indexing 66 is shown in Figure 7, and it is obviously not equivalent to Ungerboeck's indexing 
(equivalent to 56 of Figure 6). With this indexing 66, the two dimensional subspace 60 or 60' can be determined 
by two successive bits from a single binary error correcting code of minimum distance Di. The third subspace 

30 bit can determined by another binary error-correcting code of minimum distance D2. The squared Euclidean 
distance separation of any two signals created by the system is at least: 
D ^ min ( 2 Di, D2 ). 

Because longer error-correcting codes are more efficient than short ones for a given minimum distance, this 
construction in many instances gives more efficient signal sets than does Sayegh's construction and is 
35 therefore a notable advance. 

The method of construction according to the invention suggests other alternatives. For example, a 16-QASK 
can be viewed as the combination of a large grain QPSK with a smaller grain QPSK set of displacements. Each 
of the QPSK signals can be indexed either as two separate subspaces or as a single two-dimensional 
subspace. 

40 Further according to the invention, constructions based on nonbinary subspaces may also be implemented. 
With reference to Figure 8, such a possibility will be illustrated by an unusual modulation, 6-PSK. In Figure 
8 6-PSK is shown with an indexing in which a single ternary digit from GF(3) (values 0,1 or 2) determines each 
3-PSK modulation 70 or 70' and a single binary digit selects which of two possible 3-PSK modulations 70 or 70' 
is used. This leads to a sequence of subspaces consisting of two subspaces: the first is a GF(3) ternary 

45 subspace that selects a signal from a 3-PSK set and the second is a GF(2) binary subspace that selects which 
of two rotated versions of the 3-PSK signal will be used. Using simple trigonometry and assuming a unit radius 
circle, it can be determined that the squared Euclidean distance associated with the ternary subspace is 3 
whereas that for the binary subspace is 1 . Here the squared Euclidean distance separation of any two signals 
created by the system is at least 

50 D ^ min ( 3 Di, D2 )- 

As a concrete instance, the first ternary subspace code could be a ternary Hamming (121,116,3) code while 
the binary subspace could be (1 21 ,93,9) shortened BCH code. In 1 21 transmitted 6-PSK symbols 1 1 6(log2 3) 
-h 93 = 183.85 + 93 - 276.85 bits could be transmitted (as opposed to 2(121) = 242 bits for QPSK) while 
achieving an asymptotic gain of logio(9/2) = 6.53 dB. over uncoded QPSK. 

55 The encoding structure described above provides a natural parallelism that is convenient for high speed 
operation. However, using techniques considered state-of-the-art prior to the present invention, many of the 
signal sets created in this way could not be effectively decoded without the use of enormously complex 
decoding circuitry. A second advance of the present invention is to provide practical techniques for decoding 
the combined coding/modulation signal sets at high data rates and with very practical and cost-effective 

60 circuits. 

Decoding Circuits and Methods 

The present invention makes use of selected Tanner's algorithms as described in the 1981 Tanner article, 
the contents of which is incorporated herein by reference. In the present invention, the decoding/demodula- 
65 tion attempts to use all of the probability information provided by the channel for a sophisticated modulation 
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scheme in an extremely simple fashion. For high-speed performance, the preferred embodiment breaks the 
decoding process into stages corresponding to the decoding of the digits by the same subspaces used in the 
encoding process. 

The work of Imai and Hirakawa also serves as a point of departure for demodulation/decoding, in the 
preferred embodiment of the demodulator/decoder of the present invention, the methods and circuits of Irnai 5 
and Hirakawa are improved in three ways. First, decomposition is organized by general subspaces in 
accordance with the invention and implemented in the encoder/modulator of the present invention, rather than 
by single bits as suggested for the limited set of modulations treated by Imai and Hirakawa. Subspace 
organization according to the invention permits the methods of the present invention to be adapted to virtually 
any modulation scheme. 7# 

Second, in the present invention, circuits suggested in Imai and Hirakawa are replaced by "extractors" 
which, at the ith stage, produce quantized logarithms of the likelihoods: 

Pr( S — Sj ! R received), for each distinct index sj, 
where Sjs the elementary modulation indexed by Sj in the |m component; i.e., the most likely elementary 
modulation is indexed by s i+ i, si +2l . . . , s L , which are the best estimates for the values of the digits of the more 15 
sensitive subspaces produced by the decoder in the previous decoding stages. In contrastjmai and Flirakawa 
used the more complex calculation of a posteriori probabilities with "intermediate estimation circuits". 

Third, and perhaps most importantly, one or more of the decoder circuits is a decoder implementing one of 
the graph-based decoding algorithms of the Tanner article [tan81]. In the preferred embodiment, Algorithm B 
of the Tanner article is used for the first time while supplied with quantized logarithms qf probability ratios by 20 
the "extractor" of that stage. Hence, the Algorithm B is employed with "soft-decision" information. 

Referring to the flow chart of Figure 9, Algorithm B is as follows as adapted for decoding with soft decision 
information according to the present invention. (The algorithm is described with respect to decoding based on 
hard decision information on page 541 of the Tanner paper): 

1 ) Establish a formal indexing for the registers requires (Step A) : Let Rij be the register associated with bit i, 25 
where i - 1 ,2,...,N, which is accessed by subcode processor j, where J m 1,2,...,S. R f j(t3 is the value stored by 

the register Rq after the t*» iteration, and R'y is . a corresponding temporary storage register. Let Vi, where 
i - 1,2,. ..,N, be a register storing a value V r (0) which is of value +1 or-1 if the itn bit was received as a 1 (one) or 
a 0 (zero), respectively. Let Ji be the index set of the subcode processors accessing bit i, and let lj be the index 
set of bits accessed by the subcode processor j. . , so 

2) initialize the registers requires: Load each of the Vi registers with a value provided by the corresponding 
extractor based on information provided by the channel for the ith bit (Step B) . Assign register Rtj(O) the value in 
register Vi(0) for each register Ry, for which j is an element of J f (Step C), * 

3) Perform an iterative loop (Step D) comprising a subcode phase (Step D1) and a bit register phase (Step 

D2) : In the subcode phase (Step D1 ), for each value t from 1 to f, where f is a number of iterations selected as a 35 
matter of engineering choice for accuracy and speed (the end of the loop is determined in Step D3), determine 
temporary value R'y as follows: 

R' . . = 1/2 [ max ( C • R.(t - 1) - R. . (t - 1) 

c i= +1 



- max ( C • R,(t - 1) + R. .(t - 1)] 
C e a -> 13 



C.=-l 



Ri ^ (t) = uj i R ' il ' R '« + Vi(0> 



45 



for each i which is a member of the set lj, where a is the set of vectors derived from ad words in the ^subcode, 

by replacing each one in a code word with a +1 and each zero in a code word with a -1 ; , 50 

C — (Cn,Ci2,...,Cin), each derived from a code word in the subcode by replacing each 1 in the code word by a 

-f- 1 and each 0 by a -1 ; 

Rj(M) is the ordered vector of register values 

[ Rhj(M), 5*0-1), .... Ri ni (M)] 

with ii,i2, in a member of the set lj; and C«Rj denotes a real vector inner product. ...... 55 

If g/2 is odd, and t = 1, then all R (J values are divided by m, the degree of bit nodes in the graph defining the 
code (in order to avoid multiply counting bits at the bottom of the tree). 

In the bit register phase (Step D2), for each i = 1,2,...,N } the registers for the Ith bit are updated as follows: 



60 



3) Make a final decision for the value of the bit (Step E) : Using the jtn subcode processor, find the vector for 
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C which achieves the maximum value for the real vector inner product of the last iteration in the loop and store 
the corresponding component of the maximizing subcode word in the corresponding temporary storage 
register. That is, find: 

maxe eate # Rj(f)L where f is the floor function of (g-2)/4), and set FT u\ = C^. 

5 This will result in the output value of the fa bit being one if the sum of ali R'y, where j is a member of the set J { , is 
greater than zero, and zero if otherwise. In other words, the final value of the itn bit is determined by a majority 
vote of the best final estimates provided by the subcodes checking the fa bit. (Note the difference between the 
index value 1 and the index variable 1.) Alternatively, the number of iterations may be preselected according to 
an engineering preference for precision and speed. A suitable number of iterations for a code of less than 

10 length 500 is three. 

A specific decoder implementing Algorithm B is described in connecting with Figure 13. Figure 13 is 
described hereinbelow. 

The second distinction above will be more easily understood by considering the process of producing 
estimates for the values of the digits of the ithsubspace starting from the most sensitive L* subspace in the 

15 simple case where each subspace is binary. For example, consider the 8-PSK modulation of Figure 4. 
Referring now to Figure 10, using Imai and Hirakawa's method, if vector R is received, to calculate the a 
posteriori probabilities Pr(s3 = 0| R received) and Pr(s3 = 1! R received), (! R received means "given R is 
received" ) all of the eight distances 81-88 from R to each of the elementary modulations 
010,001,111,000,100,011,101,110 shown in Figure 10 must be used. 

20 in the extraction method according to the present invention, (Figure 11) only the two elementary distances, 
81, with modulation S3=0, and distance 82, with modulation S3 = 1, are used, assuming that the channel noise 
density function is a monotonically decreasing function of distance. If the channel noise density function is a 
monotonically decreasing function of distance, as is the case for white Gaussian noise, the two indicated 
distances 81 and 82 will be the shortest distances to the two most likely elementary modulations S3 = 0 and 

25 S3= 1, respectively. For relatively high signal-to-noise ratios, the a posteriori probabilities associated with the 
distances 81 and 82 are dominant in the calculation of the true a posteriori probabilities, and the probability 
calculation is greatly simplified by the use of these two alone. 

Consider an example with reference to Figure 12 and Figure 14 of 8 PSK modulations. (The full structure of a 
demodulator/decoder is described hereinafter). The decoder (306) for the least significant bit is operative to 

30 decide that for example the best estimate for S3, S3, is 0. An extractor (Figure 14 314) according to the present 
invention at the next stage is operative to calculate the logarithm of the likelihood ratio of only the two most 
likely elementary modulations. (Imai and Hirakawa transpose most significant bit and least significant bit 
designations. In Imai and Hirakawa's method, at the next stage the probability computation would have to use 
the probabilities associated with four distances, rather than only two distances 91 and 92.) The extractor may 

35 be specific for the type of noise expected in the channel. For this purpose, the extractor may be a simple 
read-only memory circuit which produces a defined precomputed output in response to an input based on a 
limited number of possible inputs. The results are used as the soft decision information for a decoder (Figure 
14, 308) which produces a best estimate for S2 = S2. In the case of 8 PSK modulations, the estimates S2 and S3 
are used by the third extractor (320) that provides log-likelihood ratios for the decoder (324) determining the 

40 best estimate for si, s-|. Since S2 and S3 are specified (forexample S2 = 1 and S3 = 0), there are only two 
elementary modulations possible si = 0 and si - 1 , as shown in Figure 12, so the third extractor (320) for the 
last stage is operative to calculate the same log-likelihood ratio as would Imai and Hirakawa's circuit, except 
much more efficiently. 

It can be appreciated that for complicated multilevel and multiphase modulations, the use of only the most 
45 likely elementary modulations represents a substantial simplification of the calculation required by the known 
prior art. 

The third advance is the use of decoders based on specific algorithms, and most specifically Tanner's 
Algorithm B above. In the preferred embodiment, the most powerful error-correcting code is applied to the 
digits governing the most sensitive index subspace of the modulation. The graph-based algorithms of Tanner, 

50 particularly Algorithm B perform decoding as multiple iterations of replicated simple primitive operations that 
can be performed in parallel. As a result they can be implemented in large scale integrated circuits which can 
handle extraordinarily high data rates and decode highly efficient codes with soft decision information using a 
relatively small number of integrated circuits. This property makes the algorithms an excellent choice for the 
error-correcting decoders of the present invention, particularly for the most powerful error-correcting code in 

55 high performance systems. 

Figure 13 illustrates a preferred embodiment of an encoder/modulator 200 according to the present 
invention for the particular case of 8 PSK modulation on a white Gaussian noise channel at an operating 
signal-to-noise ratio E s /N 0 of approximately 13.5 dB (output bit error rate of approximately 5 x 10~ 7 ) (compare 
Figure 5). Figure 14 illustrates a preferred embodiment of a demodulator/decoder 300 according to the 

60 present invention for the same type of encoding and modulation. While the encoder/modulator and 
demodulator/decoder shown here are compatible, there is nothing that precludes the use of an 
encoder/modulator of a different design with the demodulator/decoder of the invention or the use of a 
demodulator/decoder of a different kind with an encoder/modulator according to the invention so long as the 
signal information is in a recognized modulation format. 

65 Referring to Figure 13, the strongest code, for the least significant bit or third incremental subspace of the 8 
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PSK indexing, is a length 73 perfect difference set code with 45 information bits and minimum distance 10 
defined by a 73 by 73 circulant parity check matrix with 10 ones per column and 10 ones per row. The code is 
shortened by one bit to (72,44,10) by setting one of the information bits to 0 (encoder 206). The code for center 
significant bit is a [72,63,4] shortened Hamming code formed by using 72 of the weight three columns of the 
parity check matrix of a length 511 Hamming code as the parity check matrix (encoder 204). The code for the 5 
most significant bit is a trivial (72,72,1) code; that is, the informatton is uncoded (line 202). The encoder thus 
has the form shown in Figure 5 and is coupled to a digit-to-modulation mapping subsystem 1 08 wherein 8-P&K 
modulation is applied to an information channel in the conventional manner. (If the order of bit significance is 
reversed, the form of the encoder is readily compared with the ordering of subspaces and presentation of Imai 
and Hirakawa.). 10 

The structure of the demodulator/decoder system 300 according to the invention is shown in Figure 14. The 
input is a signal represented by the time-related function rW, which is an analog input signal or an equivalent 
digital signal representative of the information and noise in the communication channel (for example, a 
sampled data stream of discrete digital values which have a one-for-one correspondence with the analog 
time-domain signal at the terminal of the communication channel). The demodulator/decoder subsystem 300 15 
produces as an output the estimated digital subspace vector values site), S2<ts), and s^), which represent the 
distinct sj values delayed by M d" time units. Specific time delays are as follows: ti » t-di; t2 =» ti -<\z\ 
t 3 = t2- d3 = t-d; where di represents the delay of decoding In accordance with the order of processing for 
a received vector. Several extraction circuits are employed which are operative to calculate log-likelihood 
values as described hereinabove or as elsewhere in the art (for example, in Imai and Hirakawa). 20 

The demodulator/decoder subsystem 300 comprises a first intermediate extraction circuit Ei 302 coupled 
to receive an analog signal or equivalent bit stream rW and to apply a first extracted output signal 304 to a first 
algorithm-specific decoder Di 306. A second algorithm-specific decoder 308 is also provided. The first and 
second algorithm-specific decoders 306 and 308 implement the Tanner Algorithm B (Figure 9) and comprise a 
plurality of fixed registers and working registers (not shown) as needed to execute the algorithm. The 25 
algorithm may be implemented in connection with a digital computer processing system of a standard design 
or of a design especially suited to digital signal processing applications. One application has been described in 
the Chethik et al. paper incorporated herein by reference. 

The output of the first algorithm-specific decoder is a binary digital value s-i(ti) coupled to a first time buffer 
310, the output of which is a binary digital value site) supplied to a second time buffer 312, the output of which 30 
is the desired estimate value of si for the time slot or frame t3. 

The output of the first algorithm-specific decoder 306 is also coupled to a second intermediate extraction 
circuit 314 to which is also coupled an analog or equivalent input signal r(ti), which is the input signal r delayed 
by di time unit via a first input time buffer bi 316. The second intermediate extraction circuit E2 314 processes 
the delayed signal or equivalent bit stream rto) in view of the estimate si(*ii and applies a result in nonbinary 35 
form to the second algorithm-specific decoder D2 308. The output of the second algorithm-specific decoder 
D2 308 is the estimate S2^h which in turn is coupled to be applied to a third time buffer B3 318 and to a third 
intermediate extraction circuit E3 320. The output of the third time buffer B3 318 is the desired output estimate 

S2te). 

The output of the second algorithm-specific decoder 308 and the output of the first algorithm-specific 40 
decoder 306 are also coupled to the third intermediate extraction circuit 314 to which is also coupled an analog 
or equivalent (delayed) input signal r<*a>, which is the input signai r delayed by dr + time units via the first 
input time buffer bi 316 and a second input time buffer b2 322. The third intermediate extraction circuit E3 320 
processes the delayed signal or equivalent bit stream rte) in view of the estimates si<ti) and 52^ and applies the 
result in nonbinary form to a hard decision decoder H 324 for the most significant bit. The output of the hard 45 
decision decoder H 324 is the desired output estimate s^). 

The decoder system 300 operates as follows: The Algorithm B decoder 300 for the least significant bit (Di 
306) loads its fixed registers and working registers (not shown) for each bit with the log-likelihood ratio for that 
bit provided by the extractor (E1 302), each log-likelihood represented by a 6-bit quantity in the exemplary 
implementation. The two-phase correction process of Algorithm B (according to the flow chart of Figure 9) is 50 
repeated for example three times before a best estimate for each bit is produced. The best estimates are fed 
to the extractor for the next stage. The quantized log-likelihood values provided by the extractor for the 
second stage are fed into the second Algorithm B decoder D2 308 for the shortened Hamming code. The 
decoder 308 is structured on a graph for the code in which every bit is checked by exactly three of the nine 
parities. The resultant two estimates produced si and £2 are fed to the third extractor Es 320 for the most 55 
significant bit stage. At this operating point, no coding is required on the most significant bit inasmuch as the 
probability of error of binary PSK is lower than the error rate for the estimates produced by the (72,44,10) 
decoder. Consequently, the extractor 320 output is hard limited by the hard decision decoder 324 to produce a 
hard decision for the most significant bit. The output of the hard decision decoder H 324 is thus the desired 
output estimate §3^), which together with the other values §1 and S2 represent the best estimate of the 60 
information transmitted from the remote data source. 

Simulation studies of this particular preferred embodiment have been conducted. The system achieved 
better than 6 dB of coding gain over uncoded 8 PSK with a Gray code mapping of information to modulations at 
an uncoded bit error rate of 5 x 10~ 7 . The overall coding rate for the system was 
(44 + 63 + 72)7(3x72) - 82.80/0. This compares very favorably with any other system proposed or 65 
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implemented. At the finite bit error rate operating point of the present system and with actual implementation 
losses, a trellis code will achieve less than its asymptotic gain. 

The preferred implementation discussed is only illustrative of the many forms that combined coding and 
modulation systems realizing the advances of the present invention can assume. For example, it should be 
5 understood that the application of the invention to non-binary subspaces includes multidimensional 
subspaces and the term binary subspace as used herein applies to one-dimensional binary subspaces. It is 
therefore not intended that this invention be limited except as indicated by the appended claims. 



10 

Claims 

1. In a digital signal communication system for communicating information through an information 
channel, a method for combined coding and modulation of digital information comprising the steps of: 
15 indexing digital signals representative of elementary modulations by indexing vectors to create a 

decomposition of indexing vectors of an index vector space into a plurality of ordered subspaces, 
including binary and nonbinary subspaces ; 

associating with each said indexing vector a Euclidean distance in modulation space such that any 
two modulations whose indexing vectors differ only by a distance vector contained in a'first subspace and 
20 any preceding (higher significant) subspaces of the series of ordered subspaces are separated in said 

modulation space by at least said Euclidean distance; and 

encoding information signals by encoders employing error-correcting codes, each said encoder 
producing a symbol representative of an indexing vector of the same dimension as a corresponding one 
of said ordered subspaces for communication of said symbol through said information channel. 
25 2. A method according to claim 1 wherein said error-correcting codes are finite length block codes. 

3. A method according to claim 1 or 2 wherein said error-correcting codes are continuous overlapping 
convolutional codes. 

4. A method of encoding the modulated signals according to claim 1, 2 or 3 wherein said encoders 
encode concurrently to produce redundant indexing vectors, further including the step of providing said 

30 redundant indexing vectors to a modulator mapping subsystem concurrently along parallel data paths. 

5. In a digital signal communication system for communicating information through an information 
channel, an apparatus for combined coding and modulation of digital information comprising: 

means for indexing digital signals representative of elementary modulations by indexing vectors to 
create a decomposition of indexing vectors of an index vector space into a plurality of ordered 
35 subspaces, including binary and nonbinary subspaces ; 

means coupled to said indexing means for associating with each said indexing vector a Euclidean 
distance in modulation space such that any two modulations whose indexing vectors differ only by a 
distance vector contained in a first subspace and any preceding (higher significant) subspaces of the 
series of ordered subspaces are separated in said modulation space by at least said Euclidean distance; 
40 and 

encoding means coupled to said associating means employing error-correcting codes, each said 
encoding means for producing a symbol representative of an indexing vector of the same dimension as a 
corresponding one of said ordered subspaces for communication of said symbol through said 
information channel. 

45 6. An apparatus according to claim 5 wherein said error-correcting codes are finite length block codes. 

7. An apparatus according to claim 5 or 6 wherein said error-correcting codes are continuous 
overlapping convolutional codes. 

8. An apparatus according to Claim 5, 6 or 7 wherein said encoding means encode concurrently to 
produce redundant indexing vectors and further including a modulator mapping subsystem coupled to 

50 said encoding means, said modulator mapping subsystem for receiving said redundant indexing vectors 

concurrently along parallel data paths. 

9. An apparatus for encoding and modulating digital information to be transmitted as a signal in an 
information channel comprising: 

a first encoder; 
55 at least a second encoder; and 

a digit-to-modulation mapping subsystem coupled to receive encoded information digits from 
respective outputs of said first encoder and said at least second encoder; 
i wherein said first encoder generates a strong error-correcting code from a set of possible codes 

j including nonbinary code, with minimum Hamming distance between encoded information digits such 

60 that a change in one information digits applied to said first encoder causes a number of digits at least 

equal to the value of said minimum Hamming distance to be changed for application to said mapping 
subsystem, with a squared Euclidean distance of at least § 2 D, where £ is the Euclidean distance and D 
is the Hamming distance, independent of any changes which occur in the information digits applied to 
said at least second encoder. 
65 10. A combined decoding/demodulation apparatus for recovering digital information in the form of 
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indexing vectors in ordered subspacesfrom an information channel comprising: 

means for receiving noise and moduiations through the information channel, said receiving means 

being suited to reception of modulations introduced into said information channel including more than 

pure phase shift keying and pure multilevel signaling ; 

means coupled to said receiving means for estimating transmitted information from the received 5 

signal, said estimating means including successive stages of circuitry for extracting information 

pertaining to probability for a value of an indexing vector of a last subspace in a series of ordered 

subspaces not yet estimated as a function of estimated values of indexing vectors in all previously 

decoded subspaces; and 

means coupled to said estimating means for decoding the value of said modulations as an indexing w 
vector value for an error-correcting code associated with each successive subspace thereby to produce 
estimates for each indexing vector through a last indexing vector of a last subspace. 

11. An apparatus according to claim 10 wherein at least one of said decoding means includes means for 
computing a plurality of subcodes and means for selecting from said plurality of subcodes a best estimate 

of a digit value for a received modulation. 15 

12. An apparatus according to claim 11 wherein one of the decoding means comprises a Tanner 
Algorithm B decoder employing soft decision information. 

13. An apparatus according to claim 10, 11 or 12 wherein said extraction means comprises means for 
producing decoder input information as a function only of a posteriori probabilities for each most likely 
elementary modulation indexed by each possible indexing vector for its associated subspace in order and 20 
also indexed by estimated vectors produced by the decoders for each preceding ordered subspace. 

14. in a digital signal communication system for communicating information through an information 
channel, a method for combined decoding/demodulation to recover digital information in the form of 
indexing vectors in ordered subspaces from said information channel, said method comprising the steps 

of: ~ f 25 

receiving noise and modulation through said information channel, said modulations introduced into 

said information channel including more than pure phase shift keying and pure multilevel signaling; 

estimating transmitted information from the received signal in successive stages for extracting 

information pertaining to probability for a value of the indexing vector of a last subspace In a series of said 

ordered subspaces not yet estimated as a function of estimated values of the indexing vectors in all 30 

previously decoded subspaces ; and 

decoding said modulation value as an indexing vector value for an error-correcting code associated 

with each successive subspace thereby to produce estimates for each indexing vector through a last 

indexing vector of a last subspace. 

15. A method according to claim 14 further including the step of computing a plurality of subcodes and 35 
selecting from said plurality of subcodes a best estimate of a digit value. 

16. A method according to claim 15, wherein the decoding step includes applying a Tanner Algorithm B 
decoding method with soft decision information to at least one said indexing vector estimate. 

17. A method according to claim 14, 15 or 16, wherein said extraction step includes producing decoder 
input information which is a function only of a posteriori probabilities for each most likely elementary 40 
modulation indexed by each possible indexing vector for its associated ordered subspace and also 
indexed by estimated vectors produced by the decoders for each preceding ordered subspace. 

18. In a digital signal communication system for communicating information through an information 
channel, a method for combined decoding/demodulation to recover digital information in the form of 
indexing vectors in ordered subspaces from said information channel, said method comprising the steps 45 
of: 

receiving noise and modulations through said information channel, said moduiations introduced into 
said information channel; 

estimating transmitted information from the received signal in successive stages and producing 
decoder input information which is a function only of a posteriori probabilities for each most likely 50 
elementary modulation indexed by each possible indexing vector for its associated ordered subspace 
and also indexed by estimated vectors produced by the decoders for each preceding ordered subspace 
for extracting information pertaining to probability for a value of the indexing vector of a last subspace in a 
series of said ordered subspaces not yet estimated as a function of estimated values of the indexing 
vectors in all previously decoded subspaces ; and 55 

decoding said modulation value as an indexing vector value for an error-correcting code associated 
with each successive subspace wherein the decoding step includes applying a Tanner Algorithm B 
decoding method with soft decision information to at least one said indexing vector estimate thereby to 
produce estimates for each indexing vector through a last indexing vector of a last subspace. 
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